MOMENT BOUNDS FOR NON-LINEAR FUNCTIONALS OF THE 

PERIODOGRAM 
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Abstract. In this paper, we prove the vahdity of the Edgeworth expansion of the Dis- 
crete Fourier transforms of some linear time series. This result is applied to approach 
moments of non linear functionals of the periodogram. As an illustration, we give an ex- 
pression of the mean square error of the Geweke and Porter-Hudak estimator of the long 
memory parameter. We prove that th is estimator is rate optimal, extending the result of 
iGiraitis. Robinson, and Samarovl [19971 ] from Gaussian to linear processes. 



Keywords : linear processes, discrete Fourier transform, periodogram, long range dependence, 
Geweke and Porter-Hudak (GPH) estimator. 

1. Introduction 

Many estimators in time series analysis involve non-line ar functionals of the periodogram. 

Examples include the estimation of the i nnova tion variance Chen and Hannan ^ 198d. Lee. Cho. Kim, and Park 
1995 . Deo and Chen . 2000l. iGinoyianl . 120031 ]. log-periodogram regression Taniguchil . Il979l . 



1991 



von 



. IShiniotsu and Phillipsl. 1200211 . robust n on-parametric estimation of the spectral density 
Sachsl . I1994I . Ijanas and von Sachsl . Il995l ] . Non- linear functionals of the periodogram also 



play a predominant role in the analysis of long-memory time-series: one of the much widely 
used estimator of the memory parameter is based on the regress ion of the log-periodogram 
ordinates on the log-frequ ency Geweke and Porter-Hudakl . Il983l . see also iRobinsonI Il995bl . 
Moulines and Soulieiill999[ |. 

The statistical analysis of such functionals has proved to be a very challenging problem, due 
to the intricate dependence structure of periodogram ordinates. The first attempts to study 
these statistics were made under the additional assumption that the underlying process is 
Gaussian. Because the Fourier transform coefficients are in this case also Gaussian, one may 
then a.pply r e sults on non-line ar tr ansforms of Ga ussian random variables; see for example 
Taaaul |l977l ]. lTaniguchil [l980l ] and lArconesI jl994l |. 

These techniques do not extent to non-Gaussi an processes. A first step to weaken this 
assumption was taken by lChen and HannanI 19801 ] who proved the consistency of an additive 
functional of the log-periodogram of a linear stationary process, with an applicatio n to the 



estimation of the innovation variance. These techniqu es were based on the so-called iBartlett 
19551 ] expansion; this technique was later improved bv lFav. Moulines. and Soulier! |2002l ] who 



proved a central limit theorems for these functionals. It used b yVelascd 20001 ] to establish the 



weak consistency of the log-periodogram regression estimate of the long memory parameter for 
long range dependent linear time series. Edgeworth expansions are used to estimates moments 
of the functional of the unobservable periodogram of the innovation sequence. Remainder 
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terms can be bounded in probability. The Bartlett expansion is indeed useful to establish 

limit theorems but does not in general allow to deter mine the momeri ts of these functionals. 

An alternative approach has been considered by I von Sachd 1994], Ijanas and von Sachs 

19951 ] . These authors prove the mean-square consistency of general additive functional of non- 



linear transforms of the (tapered) periodogram, u sing Edgeworth expansions of the discrete 
Fourier transform of the observed time series itself. Ijanas and von Sachsl [19951 ] apply these re- 
sults to prove the mean-square consistency of an Huberized (peak insensitive) non-parametric 
spectral estimator. These results rely on the Edgeworth expansion of a tri angular array of 
strong ly mixing process with geometrically mixing coefficient established by iGotze and HippI 
198311. The mixing conditions herein are rather stringent, and thus the conclusions reached 



bv lJanas and von Sachsl 19951 ] are proved under a set of restrictive assumptions, precluding 
for instance their use in a long-memory context. 

The main objective of this paper is to develop a method allowing to compute the moments of 
functionals of non-linear transforms of the (possibly tapered) periodogram of a linear process. 
These results are based on Edgeworth ex pansion of a (possibly infini t e) tri angular array of 
i.i.d. random variables obtained earlier in iFav. Moulines. and Soulied 2004 ] and recalled in 
Appendix[Al The linearity of the process is then crucial. Our results cover both short-memory 
and long-memory processes. 

The remaining of the paper is organized as follows. In Section [2] we give the assump- 
tions on the linear structure of the time series and define the cumbersome notations related 
to Edgeworth expansions. In Section [3l we formulate the validity of Edgeworth expansions 
and moment bounds under short memory set of hypotheses. As an application, we derive the 
mean-square consistency of additive functionals of non-linear transform of the periodogram for 
a short-memory linear time-series. In Section (H we follow the same lines but in a long-range 
dependence framew ork, and apply the momen t bou nds we obtained to control the mean- 
square error of the iGeweke and Porter-HudakI |l983l ] estimator of the fractional difference 
parameter for a non-Gaussian linear long-memory process. This extends the rate optimality 
property of the Geweke and Porter-Hu dak (hereafter, GPH) estimator obtained earlier by 
Giraitis. Robinson, and Samarovl [19971 ] for Gaussian processes. A small Monte-Carlo experi- 
ment is run to confirm our results for finite-sample observations. Proofs are postponed to the 
appendices. 



2. Notations and assumptions 

Assume that X = {Xt)t^z is a covariance stationary process that have a spectral density 
/. For any integer r > 0, we define the tapered discrete Fourier transform (DFT) and 
periodogram of order r as 



dr,n{X) = {27rnarr'/^J2KnXte 



itX 



Ir,n{X) = \dr,nW\' 



(2.1) 



t=l 



where /i^ „ = 1 



-^2'nTt/n 



-'E 



t=i 



\h 



t,n\ 



\2r 



(r 



is the data taper introduced in iHurvich and ChenI [2000l ] and a^ 



dcf 



) is a normalization factor. Denote d 



r,n,k 



dr,n{Xk) and Ir. 



n,k 



i{\k) the tapered DFT and tapered periodogram evaluated at the Fourier frequencies 
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^k = ^^^^ 5 /c = 1, . . . , [{n — l)/2]. Define for r G N, D^^nW tlie normalized kernel function 

n ^ / \ 

A.,n(A) = {na,)-^'^Y.Kn^^viit\) = (na,)-i/2^ ( M(-1)'=D„(A + Afc) (2.2) 

t=i fc=o ^ ^ 

where -D„(A) = X^"=i e~ denotes the non-symmetric Dirichlet kernel. The latter relation 
implies that Dr^ni^k) = for A; G {1, • • • , n}, with n = [(n — 2r — l')/2| , so that the tapere d 



Fourier transform is invariant to shift in the mean. As shown in iHurvich and ChenI [20001 ] . 



the decay rate of the kernel in the frequency domain increases with the kernel order, namely 

CnV2 
VA G [-37r/2,37r/2] , |A.,n(A)| < ^^^^^^^y^, (2.3) 

This property means that higher order kernels are more effective to control frequency leakage. 
If X is a white noise and r = 0, the DFT ordinates at different Fourier frequencies are uncorre- 
lated. This property is lost by tapering. More precisely, for 1 < A; 7^ j < n, ^[dr^n,kdr,n,j] = 0, 

and ^[dr^n,kdr,n,j] = {27r)^^ (;r{k — j) where z denotes the complex conjugate of z and q. 
defined in (j3.6p . 

Many statistical applications (see the references given in the Introduction) require to study 
weighted sums of non-linear functionals of the periodogram ordinates 

Tn{XA)=Y.(^r.,k<p(^f^y (2.4) 

where {f3n,k)ke{i,...,K} is a triangular array of real numbers. If X is a Gaussian white noise, 
then {Ir,n,k) are i.i.d and the moments of the sum Tn{X, (p) can be calculated explicitly. In any 
other case, the random variables {Ir,n,k)ke{i,...,K} are not independent, and the calculation of 
th e moments of Tn(X,(t> ) is a difficult problem. The only attempt to solve it has been made 
bv lJanas and von Sachsl |l995l |. who proposed a technique to compute moment of order 1 and 



2. As already outlined, their results are based on mixing conditions, precluding their use for 
long-memory processes. 

Remark. Sometimes the periodogram ordinates are averaged along blocks of adjacent frequen- 
cies. This technique is known as pooling and is appropriate to reduce asymptotic variance of 
the estimators of non linear functionals of the periodogram (see lRobinsonl . ll995bH al). For sim- 



plicity, we will not present any explicit result or application with the pooled periodogram, but 
the Edgeworth expansion results that follow allow to derive moment bounds on functionals 
of tapered and pooled periodogram as well. 

In this contribution, we focus on non-Gaussian strict sense linear processes, i.e. it is 
assumed that 

Xt = Y, i^jZt-j , ^ ^1 < 00 , (2.5) 

jez jez 

where {Zj)ji^z is a sequence of i.i.d random variables such that E[Zi] = 0, ]E[^^] = 1. In 
addition, for some s > 3, p > 1 and p' > 0, 

(Al) E[|Zi|^] < 00 and J^\t\P' \E[e'^^^]\P dt < 00 . 
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Remark. Apart from a classical moment condition, (-A[T]) suppose that the distribution of 
the i.i.d. noise is smooth; for example, lattice distributions are forbidden. This condition is 
stronger than the usual Cramer condition. It ensures that the distributions of the Fourier 
coefficients of Z are eventually continuous. We need this continuity to bound moments of 
singular functionals of the periodogram. Note that this condition could be dispensed with, 
were we concerned with smooth functionals. 

Define ^(A) = X^jg^ tpje^^ (the convergence holds in L^([— vr, vr], dx)) the transfer function 
of the linear filter {ipj)j^z and /(A) = {2tt)~^\iP{X)\'^ the spectral density of the process X. 
For an integer k £ {I,-"" i^} such that /(Afc) / 0, define the normalized DFT iOr,n,k = 
\/2vr iir,n,fc/|V'(Afc)l- Let A;i < ^2 < • • • < ^u be an ordered tt-tuple of such integers in the 
range 1, . . . , n and write k = {ki, . . . ,ku). Define (the reference to r is suppressed in the 
notation) 

S.„(k) = [Re(Wr,„,fci),Im(Wr^„^fci),--- ,'R'e{uJr,n,kJJ^{^r,n,ku)]- (2-6) 

With those definitions, 

In,k,r = /(Afc)|a;,,„,fc|2 = /(Afc)||S„(A;)||2 . (2.7) 

Since X admits the linear representation ()2.5p . S„(k) can be further expressed as a 2u- 
dimensional infinite triangular array in the variables {Zt)t<^z- Precisely 

S„(k) = ^U„j(k)Z„ (2.8) 



with 



U„,,(k) ""^^ (na,)"V2F^i(k)^^^_^.c„_,(k), (2.9) 

Cn,t(k) = ^(-1)^ f j (^cos(tAfc^+p), sin(tAfci+p), . . . , cos(iAfc„+p), sin(tAfc„+p)^ 



p=0 

and F„(k)''=i:'diag(|V'(AfcJ|,|V'(AfcJ|,...,|V'(AfcJ|,|V^(AfcJ|) . 

To formulate our results, some not ations related to Edgew o rth ex pansions are required, which 
we take from the monograph of iBhattacharva and Raol 19761]. For u a positive integer, 
i/ = (i^i, . . . , f„) G N" and z = {zi,... ,Zu) G C, denote \v\ = XliLi ^«' '^^ = ^i!^2! • • • t^J 
and z*^ = Zi^ z^'^ • • • z'^" . If 1 < \u\ < s, denote Xn,iy(k) the cumulants of Sn(k). Then 

Xn,zy(k) = K|i^| X^jg^U^ (k) where Kr denotes the r-th cumulant of Zi, r < s. Let V„(k) = 
cov[Sn(k)] = X^ ■g2U„j(k)U' (k). Let x = {Xv] v G N"} be a set of real numbers. For 

any integer r > 2 and z G C", define Xr(z) = r\ X^i^i^^ '^'^f . The polynomials Pf.(z, x) are 
formally defined for r > 1 by the identities 

r=l r=3 m=l r=3 

and we set Pq = 0. Denote (/3v the density of a Gaussian r.v in M" with zero mean and 
non-singular covariance matrix V. Define P^ : M" h-> M by Pr(x, V,x) = Pr{—D,x) </'v(x) 



r— 2 
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where, for any polynomial P{z) = ^^ fflj^z*^, P{—D) is interpreted as a polynomial in the dif- 
ferentiation operator D, P{-D) = E^a^l-l)''^'^''' with D"" = t^f '"' ,„ , v = (i/i, . . . , i/^,) G 

N". By construction P^- and Pr do not depend on the coefficient Xt- if 1*^1 > r + 2^ and 
Pr{}t,x)e~^ '^ is the Fourier transform of Pr(x, V,%). Let ^r be a centered a-dimensional 
Gaussian vector with covariance matrix F and <? : M" ^ M a measurable mapping. Define 
Ns{g) = j^a{l + ||x||*)~%(x)| dx and H^Hp = E[g'^(^r)]- The Hermite rank of g, H^Hp < oo, 
with respect to F is defined as the smallest integer r such that there exists a polynomial 
P of degree r with E[(7(^r)-P(?r)] 7^ 0. We denote T{g,T) the (positive) Hermite rank of 
g — E[(7(^r)] with respect to F. 

3. Moment bounds: short memory case 

In this section we consider short-range dependent processes. For any reals a, (5 > and 
/3 < oo, denote by Q{a, /?, 5) the set of real sequences {tpj)ji^z such that 

Iv-oI + EIjI'^'^'I^j-I^^' (3.1) 

jez 
a< inf |V(A)| . (3.2) 

Ae[-7r,7r] 

Theorem 1. Assume (-A[T]) with some integer s > 3, p > 1 and p' = and assume that 
{^j)j^z G 0{ct,(3,5) for some a,6 > and (3 < oo. Then, there exists constants C and N 
(depending only on s,p,a, (3,6,u and the distribution of Zq) such that, for all n > N, and 
all u-tuple k of distinct integers, the distribution o/S„(k) has a density (7„^k with respect to 
Lebesgue 's measure on M?"^ and 

s-3 

sup (l + ||xr)|g„,k(x)- VP.(x,V„(k),{xn,..(k)})| <Cn"(^^2)/2_ ^3^3) 

Several interesting consequences can be derived from this result. A straightforward integra- 
tion of the expansion (j3.3p yields the following corollary which gives an Edgeworth expansion 
of some moment E[(7(S„(k))] around the centered Gaussian distribution with covariance ma- 
trix V„(k). 

Corollary 2. There exists a constant C and an integer N (depending only on s,p,a, (3,5,u 
and the distribution of Zq) such that, for any u-tuple of distinct integers k, n > iV and 
measurable function g satisfying Ns{g) < 00, 

s-3 



E[5(S„(k))] - J^ / <7(x)P,(x,V,(k),{xn,.(k)})dx 



< C Ns{g) n"(^-2)/2. (3.4) 

I TTttO.II 

r=0 ■ 

One can also use Theorem [1] to develop the same moment around the limiting Gaussian 
distribution of S„. Recalling that ujr,n,k = Or X]I=o (s)(~-'')*'^o.".'=+«' ^^ have 

lim V„(k) = V(k) 

under short memory conditions, where V(k) is the 2n x 2u matrix defined component- wise 

by 

[V(k)]2^_l,2i-l = [V(k)]2i,2j = J?r(fci - kj) , [V(k)]2.-l,2j = [V(k)]2i,2j-1 = , (3.5) 
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for i, J = 1, • • • ,u, with 

Note that V(k) = ^hu if r = 0. 

Corollary 3. There exists a constant C and N (depending only on s,p,a,P,5,u and the 
distribution of Zq), such that for all measurable function g on M^" such that N^ig) < oo, all 
u-tuple of distinct integers k, and any n > N, 

E[g{Snim- [ <7(x)(pv(k)(x)dx <c{n-V2^r3(g)+n-(3.v(k))/2||^|| 1 (3^7) 

For some functions g, it is possible to sharpen this result by considering higher-order (s > 3) 
expansions and approximating the terms appearing in these expansions. We shall consider 
mappings g : R^" — > M such that 

g{xi,...,X2u) = U'j=l9j(.X2j~l,X2j) ,gg. 

with gj{x,y) = gj{y,x) = gj{-x,y), j = l,...,u. 

Recalling (j2.7p . products of functionals of the periodogram are included in this particular 
case. Better bounds are obtained by considering frequencies ki,...,ku separated by r, so 
that the asymptotic decorrelation is achieved, V(k) = ^l2u as in the r = case. Under those 
conditions, the 0{n~^''^) of Corollary [3] can ben improved to 0{n^^). 

Corollary 4. Under the hypothesis that s > 4, there exists a constant C and N (depending 
only on s,p,a, (3,6,u and the distribution of Zq), such that for all measurable function g 
satisfying ^3. 8\) and such that Nq(^g) < 00, all u-tuple of ordered integers k such that ki < 
fcj+i — r, and any n > N, 

Eb(S„(k)]- / <7(x)¥.i,./2(x)dx <c{n-(-2)/2^.(9) + n-i(l + ||x|n<7(x)||i, 1. 

(3.9) 

The proofs of Corollaries [3] and H] are postponed to the Appendix lEl 

Remark. Pushing to higher orders s > 4 in Corollary H] is sometimes necessary to have 
Ns{g) < 00 (see the applications below). But it does not improve the 0(n~^) bound. 

To illustrate the results above, we compute bounds for the mean-square error of plug-in 
estimators of non-linear functionals of the spectral density A(/) = J^ w{X)G{f{X)) dX where 
It; is a function of bounded variation and G is a function such that there exists a function H 
satisfying, for any x > 0, J^ \H{xv)\e~'" du < 00 and J^^q H{xv)e~^ dv = G{x), i.e. H is the 
inverse Laplace transform of the function 1 1-^ G(l/t)/t. We consider the following estimator 

n 

An = {TT/n)Y,w{Xk)H{l,^k) 
k=l 

and put An = (vr/n) ^^^^ tM(Afc)G(/(Afc)). Here, r = and In,k = h,n,k is the ordinary 
periodogram. We assume that the approximation error A„ — A is neglectable in c omparison 
with the mean-square error E( A„, — A„,)^. These functional s have been studied in iTaniguchil 
|l980l | in the Gaussian case and Ijanas and von Sachsl 1995l | for non-Gaussian linear process. 
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under rather stringent assumptions [see also iDeo and Chenl . l200d. and the references therein] 



The moment bounds we have estabhshed allow to extend Ijanas and von Sachsl |l995l |'s 



result, by relaxing the conditions on the dependence (from {tpjl < Cp'^' for some p S (0, 1) to 

E,-6zljf/'lV'il<00). 

Proposition 5. Let (Xt)t£Z be sequence satisfying the assumptions of Theoreni[l\ with some 
s > 4. Put Hi{xi,X2) = H{x\ + x\), H2{xi,X2,xs,X4,) = Hi{xi,X2)Hi{x3,X4) and assume 
that N3{Hi) < cxo and N^{H2) < oo. Then, uniformly in f £ Q{a,/3,6) 

E[{K - An)'] < Cn-^ . 

Sketch of the proof. Applying Corollary [3] to the function gkj{xi,X2) = H\f{\k){x\ + x^)]) 
and CorollaryHlto gk,j,f{xi,X2,X'i,X4) = H[f {\k){x\ + x'2)\H[f {\j){x\ + x\)\ yield asymptotic 
expansions for the moments E[ff^(/„^fc)] and E[//(/„^fc)//(/„j)], which are sufficient to derive 
the result. The uniformity of the constant C follows from the existence of bounds on N2,{gk,f) 
and N^{gkjj) which are uniform in V' S Q{a,l3,5). D 

4. Moment bounds : Long memory case 

4.1. Assumptions and main results. We consider two sets of assumptions, depending on 
available information on the behavior of the spectral density outside a neighborhood of the 
zero frequency. Recall that a real valued function (p defined in a neighborhood of zero is 
regularly varying at zero with index p E M if, for all x and all t > 0, liniaj^o ^Pitx) / (J3{x) = f. 
If /O = 0, the function cp is said slowly varying at zero. Let t} G (0, vr), < S < 1/2, A < (5. 
We say that the linear filter {'ipj)j^z belongs to the set .F(t?, 6, A, p) if '^^_^ ip'^ < oo and if 
there exists d G [A, 6] such that ip{X) is regularly varying at zero with index —d and that 

/;A2'^|^(A)|2dA 



mino<|A|<,?A2'i|V'(A)|^ 



< ^ , (4.1) 



mmo<A<tf A'^|V'(A)| 

An example is provided by Tp{X) = (l — e ) the transfer function of the causal fractional 
integration filter, ^pt = T{t + d)/{r{d)r{t + l)),t > 0. 

Local-to-zero assumptions. We first consider local-to-zero assumptions for which nothing is 
required out side a neighborho od of the zero frequency, apart from integrability of the spectral 
density (see iRobinsonI [1995b( |). For /3 > 0, we say that the sequence (V'jOjez belongs to the 
set J^ioca.i{'&,P,S,A,p) if ii'j)jez G -^(^i <^, A, /x) and 

mmAg(o,^] |V'*(A)| 
with ip*{X) = (1 — e'''*)'^^(A) where d is the index of regular vari ation of ip. This class is quite 



gener al and includes the impulse response of FARIMA filters [see lDoukhan. Oppenheim. and Taqqu 



2003, and the references therein] but also processes whose spectral density may exhibit singu- 



larity outside the zero frequency, such as the Gegenbauer's processes. As seen below, under 
local-to-zero assumptions, the validity of the Edgeworth expansion can only be established 
for the DFT coefficients in a degenerating neighborhood of zero frequency. This is enough 
for, say, semi-parametric estimation of the long-memory index by the GPH method. 
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Global assumptions. In some situations, it is possible to formulate regularity assumptions 
over full the frequency range [— vr, vr] or a subset of it. These assumptions allow to prove the 
validity of the Edgeworth expansion for all the frequency ordinates. We say that the sequence 
{^jJj) belongs to the set ./^global (^j P, ^^ ^, A*) if H^j) £ -^locail^j P, ^, ^) A*) and if in addition, for 
all (A, A') G (0,??] X (0,t9], 

|,.(„_,.(y),,,m™M|,_y| ,4.4) 

Under those assumptions and as in the short-memory case, we are able to prove the validity 
of the Edgeworth expansion for the DFT's (Theorem [6]) and deduce some moment bounds 
(Corollaries [TllQland llOp . In comparison with short memory results, note that tapering (r > 0) 
and (All]) with s > p' are required. 

Theorem 6. Assume (A[T]) with some integer s > 3, p > 1 and p' > s. Let r be a positive 
integer and jS, 6, A, /_f, t? be constants such that < 5 < 1/2, — r + 1/2 < A < 0, /.t > and 
"d € (0, vr]. Let {inn)n>o be a non- decreasing sequence. Assume either 

(1 m \ 
+ ^i =0 (4.5) 
rUn n J 

or 

(V'j)jez G -^global ('*9,/3,(^, A, /x) and rrin < -dn. (4.6) 

Then there exist a constant C and positive integers Kq, Nq which depends only on {}, (3, 5, A, 
ji, the distribution of Zi and the sequence (mn), such that for any n > Nq and k = (fci, . . . ,ku) 
of integers in the range {ICq, . . . ,mn}, the distribution o/S„(k) has a density qn^k with respect 
to Lebesgue measure on M^" which satisfies 

sup (l + ||x|n|(?„,k(x)- V^.(x,V„(k),{x™,.(k)})| <Cn-(^-2)/2. (4.7) 

xeiR2« ^^Q 

Ifu = l, one can take Kq = 1. 

Integrating some function g against the density g„,^k and using (j4.7p yields the following 
corollary. 

Corollary 7. Under the assumptions of Theorem\^ there exists a constant C and an integer 
N depending only on i}, (3,5, A, /u, u, r and such that, for all u-tuple of distinct integers k 
satisfying Kq < min(k), max(k) < ?7i„ and any n > N, and all measurable function g such 
that Ng{g) < oo, 



s— 3 „ 
E[5(S„(k))] - V / 5(x)Pr(x,V„(k),{x„,..(k)})dx 



< C N,{g) n-("-2)/2. (4. 



Similarly to the short-memory case, one could approximate E[g((S„(k))] using the limiting 
distribution of S„(k) in place of the Gaussian approximation as Corollary [71 Under long-range 
dependence and for fixed k, the limiting covariance matrix of S„(k) fully depends on k and 
not only on (/c2 — fci , . . . , fc,,, — fc„,_i ). This b ehavior at "very-low frequencies" as been studied 



for instance bv iHurvich and Beltrad 1993]. However, one can control the covariance of the 
standardized DFT coefficients and then the difference V„(k) — V(k) thanks to the following 
lemma. 
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Lemma 8. For I < k < j < ^n/-K — r and r > 1, there exists a constant C depending only 
on -d, P, 6, A, fi such that 

|IE(Wr-,n,fcWr,nj)l + \^{^r,n,ki^r,n,j) " <^r{k - j)\ < Cp{k,j,n,P) (4.9) 

with 



p{k,j,n,P) 



{jk)-y^ + 

{jkyy^ 



jVk 



/3 



under 
under 



B 

G 



(4.10) 



Thus, we can develop the moments around the Gaussian distribution with covariance matrix 
V(k) as in the short-memory context. The two foUowing coroUaries prove sufficient for our 
apphcations. The next corohary is useful for moment bounds on one frequency k. 

Corollary 9. Under the assumptions of Theorem\^ there exist a constant C and a positive 
integer Nq which depend only on •&, (5, 5, A, fx, the distribution of Zi and the sequence (m„), 
such that for any n > Nq, for any integer k in the range {!,... ,mn} and any measurable 
function g on M^ such that N3{g) < oo 



E[5(S„(A;))]- / g(x)(^i,/2(x)dx 

< C {n-'/^Ns{g) + p{k, k, n, /3)-(9.v(A:))/2 ||^(x) ||i^^ } . 

The next corollary is useful for moment bounds on two frequencies k < j — r. 

Corollary 10. Under the assumptions of Theorem\^ there exist a constant C and positive 
integers Ki > Kq, Nq which depends only on ■!?, /3, 6, A, fi, the distribution of Z\ and the 
sequence {rUn), such that for any n > Nq and for any couple k = {k,j) of integers in the 
range {Kq, . . . , rUn} such that k < j — r and any measurable function g on R^ verifying liS. 8]) 
and such that N4{g) < oo 



E[5(S„(k))]- / 5(x)(^i,/2(x)dx 

<c{n-(-2)/2^,(5) + n-i/V(fc,j,n,/3)||(l + ||x|n<7(x)||i,/2} . 
4.2. GPH estimation of the long memory parameter. 

4.2.1. Theo retical results. A very widely used estimator of the memory parameter d was in- 
troduced by iGeweke and Porter-HudakI 19831 ] . It is obtained from the linear regression of the 
log-periodogram of the observations using the logarithm of the frequencies as explanatory vari- 
able. In contrast with the Whittle estimator, the GPH is defined explicitly in terms of the log- 
periodogram ordinates, see Eq. (j4.1ip below. Many theore tical work has been achieved on this 
estimator, in stationary or non-stationary contexts fs ee iFav. Moulines. Roueff. and Taqqu , 
20081 for a survey of the main results). For instance, iGiraitis et al.l 1997 ] proved that the 
GPH of Gaussian X is rate optimal for the quadratic risk and over some classes of spec- 
tral densities that is included in our J^iocal- To compute the risk of the GPH estimator, one 
need to compute or approximate moments of the log-periodogram. The log-periodogram is 
a non-smooth function of the Fourier transform of the observation, which are Gaussian if X 



is Gaussian. The proof by IGiraitis et a.1. 
tion of Gaussian variables [see lArconed . 



19941 . 



1997] relies on mo ment bounds of non-linear func- 



Soulied . I2OOII ] ; this technique does not extend 
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naturally to non-Gaussian time series. Here, we shall apply the Edgeworth approximations 
obtained in preceding section to extend this result to the case of strong sense linear process. 
For the sake of simplicity of exposition, we only consider a taper of order r = 1 and 
write Ik = A,n,fc- The GPH e stimator is obtained by an ordinary least square re gression of 
log(/fc) on log |2sin(Afc/2)| [see lCeweke and Porter-Hudakl . ll983l . lRobinsonl . ll995bl |. With the 



frequency spacing and taper order r, one regresses on every r + 1 frequency. For r = 1 it 
writes 



{d^,C) = argmiri^{log(/2fc+i) + 2d'log|2sin(A2fc+i/2)| -C'}'. 
' fc=i 
where m = m(n) is a bandwidth parameter. Explicitly 

m 
dm = s;;^ ^ I'k log(/2fc+l), (4.11) 

fe=l 

with Uk = -2(log |2sin(A2fc+i/2)|-i EjLi log |2sin(Aj/2)|) and s^ = ELi ^l We consider 

IE[((im — d)^] the mean square error (MSE) of the GPH estimator. Theorem 1111 gives a bound 
on the MSE which is uniform over a class of long-range dependent linear processes, from 
which rate optimality can be deduced. 



Theorem 11. Under the assumptions of Theorem {Bi with s >5 and conditions ^.5\ ), there 
exists a constant C which depends only on (3, 5, A, 'd, fi and the distribution of Zi such that 

nidm-d)']<cU-Y''+ ^ 



n J m 

With m proportional to n2/3/(2/3+i)^ E[((l„, - df] < Cn-^f^/^^f^+^l 

Remark. The condition s > 5 seems slightly stronger than necessary for bounding the MSE 
of d. But it is allows the function /i(xi, . . . ,X4) = g{xi,X2)g{x3,X4) with g(x) = log(||x|p) — fj 
to have finite Ns{h) norm (see Corollary [5] and the remark that follows. 

4.2.2. Monte Carlo results. Assuming more stringent global condition on the regularity of the 
spectral density allows one to evaluate the bias term in the decom position of the mean squared 
error. For comparison, using the specific set of assumptions iHurvich. Deo, and Brodskvl 



1998( 1 . we can prove that the leading terms in the MSE are of the form am^/n"^ + b/m 



for bandwidth such that lim^^oo ^/m + mlog{m)/n = 0. The constant a and b can be made 
uniform in the class of spectral densities under consideration. It shows that the MSE of the 
GPH estimator is asymptotically insensitive to the distribution of the innovation as soon 
as this distribution satisfies some moment and regularity conditions. Finite sample implica- 
tions of this statement is illustrated here by the results of a Monte Carlo study. For sample 
sizes n = 250, 500, 1000, 2500, 5000, we have simulated 1000 realizations of a FARIMA(1, d, 0) 
processes defined by 

{l-Bf-^{l-0.3B)Xt = Zt 

where B is the back-shift operator and {Zt)tez is a zero mean unit variance i.i.d sequence 
with the following marginal distributions (a) Gaussian (b) Laplacian (c) zero-mean (shifted) 
Pareto, with 

P(Zo <u) = {l-{u + 7/6)-^)l„>_7/6. 
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Whereas it is possible to simulate exactly a Gaussian FARIMA(p, d, q) process {e.g. computing 
the covariance structure and using Levinson-Durbin algorithm) , there is no general way to do 
it for non Gaussian processes. In the Monte-Carlo experiment, the process (X^) is obtained 
using a truncated MA(oo) representation. For each realization of each process, we evaluate 
the squared error {dm — d) and define the Monte Carlo MSE as the average of those errors. 
We have focused on the sensitivity with respect to the distribution of Z of the bandwidth m 
which is optimal in the MSE sense. Figure [1] and Table [1] show that for sample size n = 250 
the MSE is minimized at m = 37 or 38 which means that the optimal bandwidth is about 
the same for those three linear processes. Figure [2] represents the box-and-whiskers plot of 
the GPH estimator for two different sample sizes and the three models we are concerned 
with. Here again, the sensitivity with respect to the distribution of the driving noise is hardly 
discernible. In Tabled] we displayed the value of the bias and of the mean square error of the 
GPH at this estimated optimal bandwidth. 




15 20 25 



35 40 45 50 55 60 



Figure 1 . Comparisons of the MSE versus the bandwidth for the FARIMA 
processes (a),(b) and (c). Sample size n = 250 
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Figure 2. Box-plot of the GPH estimator for processes (a),(b) and (c), sample 
size n = 250, 2500 



Appendix A. Edgeworth expansion for triangular arrays 



In this section we recall the theorem established in lFav. Moulines. and Soulieij 20041 ]. Let 
{Zt)t£Z be an i.i.d sequence and (Unj)jez,neN an array of vectors in M", where u is an integer. 
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(a) 


(b) 


(c) 


n=250 


rriopt 


-d) 
-df) 


37 
-0.03037 
0.01507 


37 
-0.03751 
0.01508 


38 
-0.03871 
0.01569 


n=500 


rriopt 

EMc(^mopt 
IEMc((rfmopt - 


-d) 


64 
-0.02813 
0.00807 


60 
-0.01871 
0.00734 


61 
-0.02778 
0.00766 


n=1000 


mopt 

EmC ((rfmopt - 


-d) 
-df) 


106 
-0.01984 
0.00507 


117 
-0.02377 
0.00393 


107 
-0.01737 
0.00438 


n=2500 


ruopt 

^lAc{d-mapt 
EMc((f^mopt - 


-d) 
-d?) 


222 
-0.01492 
0.00207 


212 
-0.00967 
0.00202 


238 
-0.02037 
0.00219 


n=5000 


rriopt 

EMc(^mopt 
EMc((f^mopt - 


-d) 
-d?) 


377 
-0.01097 
0.00106 


385 

-0.00937 
0.00104 


370 

-0.01269 
0.00097 



Table 1. Optimal bandwidth, bias and MSE for processes (a), (b) and (c) 
and different sample sizes n. All those values are estimated by Monte Carlo 



Define S„ = Yjj&z^njZj and let V„ = J2jez'^ri,jV^j- For u G N", 2 < \u\ < s, denote 
Xn,^ the cumulants of Sn- Then Xn,v = i^\iy\ Sjgz^n r where Kj. denotes the r-th cumulant 
of Zi, r < s. Consider the following assumptions. 

(Bl) There exist positive constants v^: and v* such that 

V^ < liminf Umin[Vn] < limSUpWrnax[V„] < V* 

•n n 

where fmin[V„] (resp. WmaxiVn]) is the smallest (resp. the largest) eigenvalue of V„. 

(B2) There exist positive constants r/, cq, a sequence (M„)„gN of positive numbers, and a 
sequence ( Jr!,)nGN of subsets of Z, such that, for all n > 

sup||U„j|| <M„ (A.l) 

lim M„ = (A.2) 

n— >oo 

y iiu -IP 

card(J„) < coM-2 and ^je-^- "'-^ > ^, (a.3) 



E,-ez liu 



n,d\ 



(B3) There exist C ^ 1 and a sequence (M„)„gN satisfying (jA.ip such that 



supM^y^||U„,-|| < oo . 



n>0 .^™ 



Theorem 12 ( Fav. Moulines. and Soulieii . l2004l ). Let s > 3, andp' > &e integers andp > 1 



be a real number. Assume {.^^(s,p,p' ), (B[T]) and (B(2]). Assume in addition either (B(3]) 
or p' > s in (NT^(s.p.p' ). Then, there exist a constant C and an integer N (depending only 
on the distribution of Z\, and the constants appearing in the assumptions) such that, for all 
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n > N, the distribution of S.„ has a density q^ with respect to Lebesgue measure on M" which 
satisfies 

s-3 

sup(l + ||xr)|g„(x)-^P,(x,V„,{x„,.})| <C^||U„,,-r (A.4) 

r=0 jez 



xei 



Appendix B. Proof of Theorem [T] 

The proof consists in checking that assumptions (B[T]), (B[2]) and (B[3]) hold uniformly 
with respect to V £ 0{a,P,S) and k for Unj's of the form ()2.9p . To prove (B[T]), write 
V„(k) = V(k) + W„(k), with V(k) defined i n dMD- Define ||W||i = maxi<i<^ Yl'j=i k»,il 
for any matrix W = {wij)i<ij<v Similarly to iHannanl [l960|, P- 54], we have under (|3.1|) 

\\Wn{mi<C{a,P)n-\ (B.l) 

The matrices V(k) have the following algebraic property. 

Lemma 13. There exist two positive constants v^, and v* such that 

2v, < inf i;^in[V(k)] < sup v,^,^[\(k)] < 2v* (B.2) 

where the infimum and supremum are taken over all the u-tuples of distinct integers in N". 

Proof. Noting that trace [V(k)] = u, 

^^max[V(k)] < trace [V(k)]/2n = 1/2. (B.3) 

Take v* = 1/4. Recall that ki < ■ ■ ■ < ku- Note that, for any n > 2ku + 2r + 1, V(k) is the 

covariance matrix of 

„Y Y Y „y 



^" \^r,n,k\ ' ''r,n,fci ) • • • ) '^r,n,ku^ r,n,kuJ 

With <„,fc = (2™,n)-V2^-^^/i^_„y,cos(tAfc) and s^„^, = (27ra,n)-i/2 ^-^^ /,^^^y,sin(tAfc) 
the sine and cosine transform of a unit-variance zero-mean Gaussian white noise (Yn)n€Z- 
Recall that 

r r 

cln,k = a;^^^ Y.^-^y (n cln,k+l and s^^^^ = a;^/^ Y.^-'^^ (^) 4n,fc+i • (B.4) 

1=0 1=0 

The random variables co,n,fc and so^n,k, k = 1, . . . ,[{n— l)/2] are centered i.i.d Gaussian with 
variance l/47r. Assume that V(k) is not invertible. It yields that for some 2n-tuple of reals 
(ai,/3i,--- ,au,(3u)^iO,0,--- ,0,0), 

u 
YiajCr,n,kj + PjSr,n,kj) = 0. 

Then by ()B.4p . there exists a linear combination of co,n,fc's and so,n,fc's that is equal to 

— 1/2 

zero. co^n,ku+r and so,n,fcu+r appear in this combination with coefficients Or (— l)''a„ and 

— 1/2 

Or {—lyPu, respectively. It follows from the independence and non-degeneracy of the 

co,n,fc's and so,n,fc's that a^ = /?„ = 0. Iterating the argument yields the contradiction 

o^u = Pu = o^u-i = Pu-i = ■ ■ ■ = ai = Pi = 0. Thus for any u-tuple k of distinct inte- 
gers 

^mm[V(k)] > 0. (B.5) 
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It remains to prove that Umin[V(k)] is bounded away from zero uniformly in k. Define 

Ku = {k= {ki,--- , ku') G N"*', 1 < li' < u, < ki+i - h < r}. 

Note now that by (j3.5p . fmin[V(k)] is a function of the vector {k2 — ki^h^ — k2-, ■ ■ ■ ,ku — ku-i) 
thus taking finitely many different values on K^- From the this remark and (jB.Sp . 

vi 1^' inf t;^in[V(k)] > (B.6) 

since the infimum is taken on a finite set of positive values. Consider now a it-tuple k that 
does not belong to Ku', In this case, for some i G {1, • • • , u — 1}, /cj+i — ki > r, and then k 
may be partitioned as L > 2 blocks of indexes (ki, . . . ,ki) such that all the kj's belong to 
Ku and, for all i G {1, • • • , L — 1}, mink,_|_i — maxkj > r. Let li denotes the length of the 
block kj, i = 1, • • • , L. By this construction and (j3.5p . the matrix V(k) has a block-diagonal 
structure 

V(k) = 
Using (fRG]) . 

L L 

r;^in[V(k)](7;^ax[V(k)])2«-i > det[V(k)] = ndet[V(k,)] > H^f' = <■ (B-7) 

i=l i=l 

We conclude from (JRTJ) and (IrI) that 

^^min[V(k)] > i;^22"^i =: V2 > 0. (B.8) 

(|R2]) follows from (JR6]) and ([R8]) with v^ = i min(i;i,'(;2)- □ 

Proof of Theorem Ul By (IB.ip and Lemma [131 (Hi]) holds with u* and -y* of Lemma [131 for 
some A'o, n > Nq and uniformly in k, a and /?. With ()B.3|) . 

|^||U„j(k)f -u = |trace[V„(k)] -u| < C{a, I3)n-^/^. (B.9) 

Prove now that (5(2]) is verified. Since / is bounded away from zero and ^^g^ iV'jl ^ P < oo, 

dAH) and (1X2]) are verified with M„ =^ C(r)/3a-^/2^-i/2. Put J„ = {j, |j| < 2n}. Then 
card( Jn) < coM~^ for some cq depending only on r,a,P and 

J^ ||U„,,(k)f <C(r)a-2n-i J] (j^l^t+jl) 

\j\eZ\J„ \j\>2n \t=l I 

n 

|j|>2nt=l |j|>n 

Under ([XI]), iV'jl < /^Ijl"^/^"'' so that 

E b1^J < /?n-^^ 5; b-r/^+'lV'.l < P'n-^' (B.ll) 
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For any e > and large enough n, X^|j|>„ ||U.„j(k)|p < e uniformly in k and ^p £ Q{a,(3,6). 
(|X3]) follows from (iBlQl) . (|BlO]) and (IBTT]) . Finally, 

n 

5:i|U„,(k)|| < C(r)a-V2,-V2^^|^^^^.| =C'(r)a-V2,i/2^|^^.| 

jeZ t=l JGZ jGZ 

so that (B(3]) holds with C = 1- □ 

Appendix C. Proof of Lemma [8] 

The proof is an adaptation of iLang and Soulien 20021 ] to fit our need of uniformity of the 
bounds with respect to the function ip whether it belongs to ^global or ^locai only. For sake 
of brevity, the proof is omitted and we refer the interested reader to their paper. It derives 
from their more general analytical lemma that we recall here. 



Lemma 14 (jLang and Soulier! |2002l |). Let q gN, K > 1, '& £ (0,7r]. Let ^ be an integrable 
function on [— 7r,7r], such that for all x G (0,19] \ {0}, 'ip{—x) = tp{x) and 

|V'(x)-V(y)l <i^^^^^^^^^|x-y|, forall{x,y) G (0,^] x (0,^] (C.l) 

X /\y 

Assume that |^| is regularly varying at zero with index p and that 

fS g(s) 

ip{x) = x''c{x)ex];) ^ I ds 



with (i) lim2;-»o 9i^) = 0; (ii) lim2;_»o c{x) exists in (0, oo). Let Dn be such that for x G [— vr, vr] 

ni/2 



\Dn{x)\ < C- 



(C.2) 



{l + n\x\)i+^' 

• Lf p € {—l,2q + 1), there exists a constant C such that, for all n > 1 and all k such 
that < Xfc < '&/2, 

■iIj{x) 



1 iDJxk -x)\ dx 



< Clog''^'^^k)k-\ 



(C.3) 



with 1/(0) = 1 and v{q) =0 if q > 1. 
• If p £ {—l/2,q + 1/2), there exists a constant C such that, for all n > 1 and all 
integers k,j such that Q < x^ ^ Xj < ^/2, 

f —7 — r - 1 j Dn{xk - x)Dn{xj - x) dx 



+ 



< 



C(l + 



■j)Mk)\)\k - jr'^ij V ky^ < C{jky'/^ (g > 0); 



C{jk)-^/Hog{jVk) ((7 = 0). 

For any /? > 0, if p G (— 1, 2g + 1), for any integer k such that < Xfc < '&/2, 



^ ip{x)\x\^ 

-TT i^iXk) 



Dn{xk -x)\'^dx 



<C7|A;-2'/-i + (^/^)/3| 



(C.4) 



(C.5) 
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• If p & (—1/2, q + 1/2), for any integers j, k such that < x^ ^ Xj < 'd/2, 



/ —7 r-\Dnixk- x)Dn{Xj -x)\dx + / — —\Dn{xk - x)Dn{xj + x)\ dx 

< C(l + |^(x,)/V(xfc)|) {(jA;)-'^(i V kr' + U - kr^-^ij V A:)/n)'3 log-W(j V k)] 

< C[{jk)-y^ + ((j V k)/n)(' log-('^)(j V A:)} . (C.6) 

Appendix D. Proof of Theorem [6] 
The proof of Theorem [S] consists in checking that assumptions (B[T]), (B(2]) hold uniformly. 

Lemma 15. There exist integers Nq, Kq, and -y* > 0, f * > (depending only on t9, P, 5, A, fi) 
such that, for all n > Nq, we have, 

(1) for all u-tuple k of distinct integers, 1 < mink < maxk < m„, 

t^max[V„(k)] < V* ; (D.l) 

(2) for all integer k, 1 < k < m^ 

V* < V^in[Vn{k)] ; (D.2) 

(3) for all u-tuple k of distinct integers, Kq < mink < maxk < ?Ti„, 

V, < ?;mm[V„(k)] . (D.3) 

Proof. As in Appendix [Bj we put V„(k) = V(k) + Wri(k) where V(k) is defined in (|3.5p . 
Applying Lemma El we obtain 

[^ under (14. Gh . 

(jD.ip follows immediately. The proof of (jD.SP follows by picking Nq,Kq large enough. 
For ()D.2p . it remains to prove that for any integer k, 1 < k < Kq, V.n(/c) converges to 

a positive definite matrix V(A:) and that this convergence is uniform w.r.t t o iJj, for ifj £ 

^]nr.a] ("&, P, S, A, /z) or ^ S .Fgiobai(^) /?! <^) ^i /")• What follows is an adaptation of [louditskv. Moulines. and Soulier 



2001 



, Lemma 7.3]. Write 
n\^r,n,k\^] = -Jrr if + I ) \Dr,n{\ " \k)?f{\)d\ =: ^1 + ^2 (D.5) 

where A-,n is defined in ([22]). For n > A-kKq/'O, I < k < Kq and |A| > ^ir, |n(A-Afc)| > nt?/2. 
Using ([23]) and (gT]), we get 

A2 < TmTTTT^"''"' / ^""fWdX < Cn-^' (D.6) 

H fi^k) J\\\>^n 



By change of variable, 

'|A|<m9 



„2d|-| — iAfc|2d /■ 2 \ 

^1 = " ^ " , ' / n-V2^ (A/n - A,) n-''\l - e--^'h~''n-)dX. 

/*(Afc) J\x\<n& n 
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Write \imn^oon~^/^Dr,niX/n) = -^ J^\l - e^'^^^Ye~i^^ds =: 'hr{X). By Riemann ap- 
proximation, it can be seen that |n~^/^-Dr,n(A/n) — hr{X)\ < C(l + |A|)/n. Note also that 
\hr{X)\ < C|A|-'''-^ Then 



2dn „iAt|2(i rni3 



A^ 



n \1 — e 



f*{Xk) 



n-a 



hr{X-2Trk) 



\-2<i|l_eiVn|-2rf/*(^)dA 



n 



< Ck^'^n-'' < Cn-\ (D.7) 

Here and in the following, C is a generic constant which depends only on f?, /3, 6, A, //, r and 
Kq. For |A| < ni?, using (|i311 . 

\n^)-r 



no) 

<c 



n 



'2dii _ iX/n\-2d _ I \|-2d 
^-2d|-^_giA/n|-2d_|;^|-2d 



+ 



/*(Afc) 



-n 



-2d|-| _ iX/n\-2d 



+ ^-2dM _giA/n|-2d|^|/3' 

n 



(D.8) 



with/3' = /3A1. For X G [-7r,7r], ||x| < |e'^ -- 1| = |2sin|| < |x| and ||e'^-l| - |x|| < x'^/2. 
Also, for any i; E M, x > 0, y > 0, |x" — y"| < |f Kx"""*^ V y^~^)\x — y\. Using those relations, 
write, for A G [—mr,mr], 



„-2d|-|_ _giA/n|-2d 



|A| 



-2d 






2d| A|_2d-1 



|l_eWn|_|^| 
n 



1| M-2(i+l 



Then 



n-d 



n-d 



\hr{2TTk-X)f 



<Cn-'\X\ 



f^-2<iM _giA/n|-2d 



|A| 



-2d 



dA 



ni? 



and 



<Cn-^ I |/i^(27rA;-A)|2|Ar2d+i^^ 

< Cn-^ I \hri2TTk - A)|2|A|2''dA < Cn'^ 

-n-d 



/n-d \ 

\hr{2TTk - A)|2n-2'^|1 - e'^/"|-2'^|-|^'dA 



(D.9) 



< n' 



-P' 



n-d 



n-d 



\hr{2TTk - X)\^\X\ 



2\x\-2d+P' 



dX 



< Cn-"'. 
Gathering (ID31) . jES]), (iDTl) . jEH), dEH), (iDTOl) yields 



(D.IO) 



E[|lx;r,n,, 



2di,2d f*((W /--l-oo 



(2^)^^A;^'^/*(0) 



/*(Afc) 
Similar arguments leads to 

(27r)2'^fc2'^r(0) /■+°° 



/ir(A-27rA;) |A|"^'*dA 



< Cn 



-d' 



E[cj; 



r,n,fcJ 



/*(A;. 



2d, 



/i,.(A - 2'j:k)hr{X + 27rA:)|Ar^'*(iA 



<Cn 



-/3' 
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Defining the scalar product {u,v)d = f^u{X)v{X)\X\ '^'^dX, Then detV„(fc) is uniformly ap- 
proximated by the Gram determinant of the functions hr{X — 2k'Tr) and hr{X + 2kTT) associated 
with the product (•, ■)d and then is a continuous function of ^-l"^) '■= '^^^^n~>oo^[\^r,n,k\ ] and 
ri'j^{d) := lim„^oo IE[u;^^ ^].. The whole set of functions hr{X + 2JTr),j S Z is linearly inde- 
pendent, so that those determinant are positive. Using continuity of rjk and rj'f, w.r.t. d, the 
infimum on the compact set [—A, 5] and the minimum over k = 1,...,Kq is positive too, 
which concludes the proof. D 

Lemma 16. There exists a constant C (depending only on -d, (3, 5, A, fj,,r) such that for all 
k G {l,...,h}, 



1 



vW(AaO 



Y,hl^^Pt+Je'''^ 



t=i 



< Cn~^l\ 



(D.ll) 



Proof. The main tool of the proof is the bound (|2.3p and the technique are the same as the 
one used in the proof of Lemma [51 Decompose 



\i;iXk)\-'^^=^hl^^l^t-,^'^'' = mXkT' [ ijiX)e''^D^,r{Xk-X)dX 



into 



Ai = |V(Afc)|- 



+ 



xl^{X)e'^^DnAh - A)rfA, 



|V'(Afc)|-V*(0) / {I - e'^ye'^^D^Ah - X)dX, 



1? 



-1 



^3 = |V'(Afc)r^ / (l-e 
-1? 



i9 



iA\— a!/„;,* 



(r (A) - rmy'^DnA^k - X)dX. 



By Eq. ^3^, if |A| G [??,^], \DnA^k - A)| < Cn~^/^~''. Note that n-^Af = n-^X^^Xf^^ < 
l/{2TTk). dUD implies that |yli| < Cn^/^'^'k'^ < Cn~^/^. Consider ^2- Since /^^ L>„,,.(A)dA = 
0, 



Ao 



A(A, Afe)Z),,,(Afc - A)dA, A(A, A^) = ((1 - e'^)"'^ - (1 - e^^^)"'^) e'^^mXkT'. 



Decompose this integral on the intervals [— ??, — Afc/2], [—Xk/2, Xk/2], [Xk/2,2Xk] and [2Xk,'&]- 
If A G [-Xk/2, Xk/2], then \Dn,r{^k - A)| < C./Ek-'^-^ and |A(A,Afc)| < C7 (jAj-'^A^ + l). 
Hence : 

"Afe/2 

A(A, Afc)£'„,r(Afc - A)dA < Ck-'n-^/^. 

-Afc/2 

If A G [A;^./2,2Afc], then |A(A,Afc)| < C (A^^jA - Afc| + l). Since j\^^{l + n\X\y'-^dX < 
Cn~^, we have 



2A,, 



Afe/2 



A(A,Afc)D„,^(Afc-A)dA 



< Cn~'/^. 
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If A G [2Xk,'d] (and similarly on [-??, -Afc/2]), we use that |A(A,Afc)| < C(A~'^Af + 1) and 



2A. 



A(A,Afe)Z)„,,(Afc-A)dA 






l^k~r 



Consider ^3. Under (j4.3p . we have 

|^3|<CAf/ \X\-''+P\Dn,r{^-Xk)\d\. 



Decompose this integral as above. If A € [— Afc/2, Afc/2], proceeding as above: 



\ 



Afe/2 



-Afc/2 



\\l''^^\Dn,r{\ - \k)\d\ < Cn-^/^k-'-Xl 



If Afc e [Afc/2,2Afc], Af|Ar'^+'3 < ^^^^ ^^^ j2A, |^^^^^^ _ ^^^|^^ ^ (-7^-1/2. Hence: 



-^fc 



2A,, 



Afe/2 



\X\-''+^\DnA>^ - Xk)\dX < Cn-^/^Xl 



Finally, if A G [2Afc, ^] (and similarly, if A G [— f?, — Afc/2]), we have as above: 

J2Afe JAfe 



D 



Lemma 17. There exists a constant C (depending only on ■!?, /?, (5, A, fi,r) such that for all 
k G {!,... ,n}, 



1 



Vnf{Xk 



Y^ /i[,„^t+,e"^* 



t=i 



< Cn-1/2 ^d-i(^ ^ |^.|)d-i < c'n-i/2((i + \j\)/nf-\ (D.12) 



Proof. By applying the definition of the weights /ij „ and summation by parts, we have: 

n r n 

t=l p=0 t=i 

n n— 1 ( / t \ / n \ ^ 



E^*+^-«'*''^' = E E«"'^- (V't+. - V't+.+i) + E 



e'"'^ V^n+j 



t=i 



t=l I, \u=l 



^«=1 



For all y G (0,7r) and aU £ G N*, E«=i e'"^ ^ 2/y. The proof follows from condition (g^J. 

D 

Proceed now with the proof of Theorem[6l If \j\ > n, then ((1 + \j\)/n)'^~^ < 1. Hence by 
Lemma \T6\ for some constant C which depends only on /3, 6, A, i?, //, r and the distribution 
of Zi, 

V j,n,k, Mnj = Cn-^l'' (l A ((1 + |j|)/n)'^-^) > ||U„,,-(k)||. 

Note that 

Mn =^ sup Mn j = Cn-^l'^. 

■j&L 
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Then ([XT]) and (|X2]) hold uniformly in k. By Lemma HSl Eq. ([D^ or ([D?2]) . we have 

Y, l|U„,,(k)f = trace[V„(k)] > v, > 0. 
j 
Finally, define for any 7 > 1 the set J„ = {j S Z, \j\ < ^n}. Then card(J„) < cqM~'^ and 



E 



jei 



|U„j(k)||^ 



< 



E 



J 6 



E,ezl|Un,,(k)f -E,6zl|Un,,(k)|| 



lil>7" 



< C{v,)-'j 



1,,25-1 



Choosing 7 large enough yields (|A.3P uniformly. 

Appendix E. Proofs of Corollaries O [H [9] and [10] 
Proof of Corollary [3 By the triangle inequality, the LHS of inequality ([3.7p is bounded by 



IEb(Sn(k))] 



5'(x)i/3v„{k)(x)dx 



+ 



/ S-W {v3v„(k)(x) - 93v(k)(x)} dx 



By Corollary [2] with s = 3, the first term of the previous display is bounded by Cn~^''^N^{g). 
For A a matrix, denote p{A) its spectral radius. Denote la the a-dimensional identity matrix. 
To bound the second term, note that p(Vn(k) — V(k)) < C{a,P)n~^ by ([B.lh and that 
rfq, V(k)) > 1 by definition, then apply the following lemma which is an easy adaptation of 
Soulieii J2nnil . Theorem 2.1]. D 



Lemma 18. Let T be a u- dimensional positive matrix. There exists e > and a constant 
C such that, for all symmetric positive matrix V verifying p{T'~^ — V~^) < e, and for all 
measurable functions g on M" satisfying \\g\\f^ < 00, we have 



JR" 



< Cp^(3'r)/2(r' - p) ii^iir . 



Proof of Corollary \^ The LHS of i^M is bounded by ^1 + ^2 + ^3 + ^4 with 

s-3 

EL ,' .. , 

r=0 



A2 

A3 

Aa 



b(S„(k))]- / <7(x)VP.(x,V„(k),{xn,.(k)})}dx 
/ ^(x) {v9v„(k)(x) - V3i2„/2(x)} dx 

JR2" 



<7(x)Pi(x,V„(k),{xn..(k)})dx 

„ s—3 

/ 5(x)VPr(x,V„(k),{x„,.(k)})}dx 



r=2 



^4 = if s = 4. Using ()3.8p . we get T{g, l2u/2) = 2. It follows, as in the proof of Corollary [3] 
that A2 is bounded by Cn~^||5||v(k)) whereas Ai is bounded by Cn~^^~'^'''^Ns{g). Write 
shortly Pr-(x, V„(k), {xn.t-lk)}) = i?r(x)v9Vn(k)! where Rr- is a polynomial of order r + 2 (the 
dependence w.r.t V.„(k) and {xji,iy(k)} is ommited in this notation). Note also that 

|x„,.(k)| < |K|,||^||U„,,(k)ri < |K|,||MM-2(j;i|U„,,(k)f ) 
iez jGZ 

< |K|,||Ml;^l-2trace[V„(k)] < C\k\,\\m\^\-^ (E.l) 
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where M„ < C(a,/3)n~^/^. Then, the coefficients of R^ are 0{n~^'^'i'^') uniformly in k 
and if: since they involve Xn,iy(k)'s with \v\ = r and elements of V~^(k) [for details, see 



Bhattacharva and Rad . ll976l |. Let F„(k) =^ (V-i(k) - V-i(k))/2 and write 



5'(x)'/?v„(k)(x)i?r(x)dx 



detV(k) 



detV„(k) 



1/2 



/ 5'(x)i?r(x) exp{-x'F„(k)x}(^v(fc) W ^^ 



(E.2) 

By dE]), ||F„(k)||i < Cn-i and | det(V„(k))-i/2 - det(l2„/2)-V2| < Cn'^ uniformly so 
that A4 < Cn~^ 11(1 + ||x||'*)5(x) 11212^/3. We can derive this way that ^43 < Cn^-*^'^ which is 
not enough. Improving this bound requires some care and uses the symmetries of g. Actually, 
i?i is a sum of polynomials which are odd with respect to one or three components. Write 

|exp{-x'F„(k)x} -l + x'F„(k)x| < Cn^2||x||4 g^p {Cn^i||xf } (E.3) 

and notice that {1 — x'F.„(k)x}i?i(x) is a sum of polynomials of the form Hi ^«(^2i-i,a;2j)i 
each of them being odd with respect to at least one variable. Consider a typical term odd 
with respect to xi, say. Using ()3.8p 



/ 5(x)TTri(3;2i_i,X2i)(^i2 /2(x)dx = / gi{xi,X2)ri{xi,X2)<fi^/2{xi,X2)dxidx2 

^ / Y\gi{x2i-l,X2i)ri{x2i-l,X2i)(pi2^_^/2{x.)dx3--- dX2u = 0, 

since the first integral vanishes. Hence, L4 /i(x)i?i(x)(x'F„(k)x)93i^(x) dx = 0. Gathering 
(lEljl . dES) and JES]), A3 < Cn-"^. D 

Proofs of Corollaries \M and\l(A As those corollaries are the counterparts of Corollaries[3]and[4] 
in a long memory context, we only gives the necessary adaptations from the preceding proofs. 
From LemmaEl p(Vn{k) -V{k)) < Cp{k,k,n,p), ||F„(k)||i < Cp{k,j,n,l3) and 

|det(V„(k))^i/2 -det(V(k))^V2| < Cp{k,j,n,P). 

The LHS of dESl) is now bounded by p2(A;,j,n,/3) ||x||^ exp {Cp(/c,j,n,/3)||x||2}. The term A3 
is then bounded by 

C7n~^/V(fc,j,n,/3) / ||xf /i(x) exp{-||xf (1 + C7j,(A:, j>,/3))} dx. 

If m = o{n) and Ki > 2C, then for large enough n and Ki <k<j — r<m — r, the integral 
is uniformly bounded. Thus A3 < Cn^^''^p'^{k,j,n, (3) whereas Ai < Cp'^{k,j,n,P). D 

Appendix F. Proof of Theorem [TT] 

In the sequel, C denotes a constant which depends only on /?, 5, 'd, fi and the distri- 
bution of Zi and whose value may change upon each appe ara nce. Note first that li^^l = 
0(log (/c)), sf^/m — > C > (see for instance |Robinson| [1995bl |. or lHurvich. Deo, and Brodskvi 



|l998l ]). Define /*(A) = |l-e-'^|-2'^/(A) and L(A) = log(/*(AJ//*(0)). Since V' e J^(t?,/3,<5, A,/i), 
there exists a constant C such that 

yke {!,■■■ ,m}, |L(Afc)|<C|Afc|^. (F.l) 
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Let f] denote E(log ||Y|p) where Y is a centered Gaussian random vector with covariance 
matrix I2/2. Define 77^ = log(/fc//(Afc)) — ij, 1 < k < m. With these notations and since 

Er=i ^k = 0, em) yields 

m m 

t^m = d + Sm^ ^ l^kflk + Sm^ ^^ ^kL{\k) =: d + Wm + &m- (F.2) 

fc=l fc=l 

The mean-square error of the GPH writes E((dm - d)"^) = EW^ + 2bmMWm. + b^^. Applying 
(jF.ip and the Cauchy-Schwartz inequality, 

m 

\bm\ < Cs-;^ Y. \''^\^l ^ C{m/nf. (F.3) 

fc=i 

Thus, to prove Theorem II H we only need to show that E[W^] < Cm~^. We now compute 
E[W^]. Let i = i{m) be a non decreasing sequence of integers such that 1 < i < m and 
define Wi^rn = s;;^ Yli=i '^kVk and W2,m = Wm - Wi^rn- We first give a bound for E[VFf „J. 
Note that 

E[wlJ<is;;fY.''knvl]- (F.4) 

fc=i 

For X G M^, define g{x.) = log(||x|p) - ij. Then % = g{Sn,k) and N3{g'^) < 00. For 
(xi,...,X4) G M^, define /i(xi, . . . ,^4) = g(xi,X2)gix3,X4). Then 77^7?^- = /i(S„^(fcj)), /i has 
property (|3.8p and 

Jf,4 i + ||x||3 

where we have used 4(l+(a2+62)s/2) > (i+|a|«/2)(i+|5|s/2)_ Note that iV4(/i) = N2{g) = +cx), 

which motivates the expansion up to order s = 5. Let c^ = var(log ||Y|p) = tt^/G. Applying 
Corollaries [9l and 1101 respectively to the functions g, h, we get for some integer Iq and any k,j 
such that Iq < k < j < m, 

mr]l]-a^\ <CiP,5,^,n)\^k^^ + {k/nf + n~^/^^ (F.5) 

|E[7?fc7?,]| <C(/3,(^,7?,^){A:-2 + (jyn)2'3 + n-i}. (F.6) 

dEl and dEH) yield EfVFf^^] < Cfm-^. We now bound E[VF|^]: 

E[^^l,n^] = S-' Yl ""'nvl] + 2sm' Y. ^k^jnvkVj]- 

k=e+l e<k<j<rn 

Using (jF.SP and (|F.6p . we obtain 



|lE[W^|J-s;;V|<C(/?,5,79,/.)s-4 Y ul(k-^ + {k/nf + n-^/^^ 

k=e+i 

i<k<j<m 

=C{P, 6, 7?, //)C {1 + (r 1/2 + „^V2/-3/2 ^ ^2/3+1^-2/3 ^ ^/^^ I ^ 

(F.7) 
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Choosing £ < m such that i'^ = o{rn) and m = o(£^) (for instance (. = [mJ'\ with 1/3 < t] < 
1/2) yields E[W^] = 0{m~^). This bound and (jF.3[) conclude the proof of Theorem II 1[ 
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